Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Efficient Vision-Language Understanding
# Efficient Vision-Language Understanding
Tinyllava Phi 2 SigLIP 3.1B
Apache-2.0
TinyLLaVA-Phi-2-SigLIP-3.1B is a small-scale large multimodal model with 3.1B parameters, combining the Phi-2 language model and SigLIP vision model, outperforming some 7B models.
Image-to-Text
Transformers
T
tinyllava
4,295
16
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase